Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 0603720110170010024
Journal of Korean Society of Medical Informatics
2011 Volume.17 No. 1 p.24 ~ p.28
Evaluation of Co-occurring Terms in Clinical Doc-uments Using Latent Semantic Indexing
Han Choong-Hyun

Yoo Soo-Young
Choi Jin-Wook
Abstract
Objectives: Measurement of similarities between documents is typically influenced by the sparseness of the term-document matrix employed. Latent semantic indexing (LSI) may improve the results of this type of analysis.

Methods: In this study, LSI was utilized in an attempt to reduce the term vector space of clinical documents and newspaper editorials.

Results: After ap-plying LSI, document similarities were revealed more clearly in clinical documents than editorials. Clinical documents which can be characterized with co-occurring medical terms, various expressions for the same concepts, abbreviations, and typo-graphical errors showed increased improvement with regards to a correlation between co-occurring terms and document similarities.

Conslusions: Our results showed that LSI can be used effectively to measure similarities in clinical documents. In addition, correlation between the co-occurrence of terms and similarities realized in this study is an important positive feature associated with LSI.
KEYWORD
Information Storage and Retrieval, Cluster Analysis, Documentation
FullTexts / Linksout information
 
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI) KoreaMed ´ëÇÑÀÇÇÐȸ ȸ¿ø